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OF BIODATA 
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Field Team at Flight Research Center 

and 

Charles E. Lewis, Jr. 

Flight Research Center 


SUMMARY 


Before any data are statistically analyzed, it is always necessary to edit the data 
to some extent. Furthermore, when large quantities of data are collected, the editing 
must be performed by automatic means. One common task in the editing process is 
the identification of observations which deviate markedly from the rest of the sample, 
commonly known as outliers. A simple statistical technique for identifying the outliers 
and the necessary computer program is presented in this report. The program re- 
quires as input only the data set, sample size, and preselected levels of significance 
at which outliers are to be identified. It is assumed that the data set is a random 
sample of size larger than two from a normal population. Two examples are presented 
to illustrate applications of the described technique. 


INTRODUCTION 


The NASA Flight Research Center is engaged in an extensive research and develop- 
ment program aimed at advancing the state of the art in medical monitoring of humans 
in flight (ref. 1). Under this program, more biomedical information is collected in 
flight than is collected under the sum of all other known flight programs. An effort of 
this magnitude depends entirely on the Flight Research Center’s capacity for collecting, 
reducing, and analyzing these data by automatic means, including development of new 
techniques for accomplishing this work. No matter how sophisticated the monitoring, 
collection, and reduction systems, some editing of the biodata is required before they 
can be analyzed statistically. 

The reduced biodata may contain observations that deviate markedly from the rest 
of the sample, or from the trend of the data set. Such measurements are termed 
outlying observations, or outliers. An outlier may be subject to errors other than the 
usual random fluctuations characterizing the population to which the data belong, or 
may merely occur too infrequently to be considered in a particular analysis. Most of 
the statistical tests which are available to detect and decide whether an outlier is too 



rare to be acceptable are either too complex or too cumbersome for general application. 
One method suggested originally by Cram dr (ref. 2) is derived in this paper and its use 
demonstrated. This method was chosen because of its simplicity and easy applicability 
in editing biodata. A computer program for automatic editing was written in FORTRAN 
IV, based on Cramer's suggestion. The input to the program is sample data, sample 
size, and a preselected level of significance. The data sample is either a set of obser- 
vations or the deviations of the observations from a model, whichever is appropriate. 
Before testing the outliers, the program computes and prints the mean and standard 
deviation. After the test is performed, the data are printed with each outlier identified 
by an asterisk. Then the mean, standard deviation, and sample size are recalculated 
and printed, excluding the outliers. Four repetitions of the test may be made for dif- 
ferent levels of significance. 

Two examples are presented. In the first example the observations meet the 
assumption of random sample from a normal population. In the second example the 
input data are deviations of the observations from a given model. 

The program source listing, with instructions, and sample problems are presented 
in appendixes A to C. For convenience in manual computations, a related probability 
table of values (table I) is included. 


SYMBOLS 


F(x) distribution function at x 

f r (x) density function of t at point x 


I 


i 


summation starting from 
integers between 1 and 


i through k, where i and k 
n, and i is less than k 


are 


n 


sample size 


s = 



(standard deviation of sample) 


t 


random variable of Student’s distribution 


t / 


random variable related to t distribution 


x arithmetic average of sample values 

x 1 ,x 2 , . . . ,x n elements of observed sample 
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r 


y 1 »y 2 » • • • >y n orthogonalized variables obtained from Xp x^, .... x Q 

a level of significance between 0 and 1 

T(r) gamma function at r. If r is an integer T(r) = (r - 1)! 

cr standard deviation 

t random variable obtained from Student’s t distribution 

t* value of t at a level of significance 

BRIEF TEST DESCRIPTION 


The outlier test consists of computing a parameter t* for the data set and a value 
Tj for each member in the data set, comparing each with the t*, and identifying the 

members with a greater than r*. The set parameter t* is a function of the 

sample and the level of significance at which outliers are to be identified. Each r . is 

a function of the value of the member and the mean and standard deviation of the data 
set. As previously stated, the members must constitute a random sample of size 
larger than two from a normal population. 


DERIVATION OF TEST 


Let x-p x 2 x n be a random sample of size n from a normal distribution 

with mean 0 and standard deviation ex. Then the variable 



has a Student’s central t distribution, but the variable 

X 1 

T = 



L 



does not because here the numerator and denominator are not independent. Moreover, 
t" £ n; therefore, the distribution of r has a zero probability outside the interval 
(- VTT7 Vn). Defining 


n - 1 t _ r yn - 1 

yfi-T ' r7T? 


it is seen that V is distributed as a central t with (n - 1) degrees of freedom. From 
this the density function of t is obtained as 

! r<|) x 2 

f (x) = — (!-■=-) for |x| S Vn" 

T Vni n 1 ^) n 

for n >2. Replacing the x A variables with new y. variables by means of an 
orthogonal transformation such that the first two are 




and 


it is found that 


Consequently, the variable 


y 1 = Vn x 



which expresses the deviation of the sample value x^ from the sample mean x in 
terms of measured units of the standard deviation of the sample, becomes 



The variables y^, i = 2. . . , n are independent and normally distributed with 0 mean 
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and standard deviation < 7 . The variable 


T lv / n ~ 2 

Vn - 1 - Tj 2 

is then distributed as a Student’s t with (n - 2) degrees of freedom. These results 
hold irrespective of the value of the mean, and for any relative deviation 

- x 
s 


PROGRAM APPLICATIONS 


Given a sample of data x-p . . .x n> compute the sample estimate of the mean 


n 

=iy x , 

n^ 1 


_ 1 
x 


and 


If 2 - 2 

Z* Xj - nx 


s =1 


n 


Then compute critical values of t* by the relation 


-* = 


t >/n - 1 


y/n - 2 + t^ 


where t is the Student’s central t-value at a level with (n - 2) degrees of freedom 
and n is the sample size. Next compute values for n sample points 


T i = 



If 



> T* 


then Xj is considered an outlier at significance level a. 

The program (appendix A) follows this method to detect and identify (by *) the 
outliers. An option of the program allows the user to repeat the test for different 
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levels of significance. The required input parameters are as follows: 

1. Format of the data to be read. 

2. The sample size n. 

3. The significance level a values. 

4. The data format is as specified. 

The input procedure is completely described in the comments at the beginning of the 
source listing (appendix A). 


EXAMPLES 


Example 1 

One-minute heart rates from a 71-minute flight piloted by a student pilot from the 
Aerospace Research Pilot School at Edwards Air Force Base are used to illustrate the 
described method of editing for outliers. The heart rates in this example satisfy the 
assumption of random sampling from a normal population. The sample output, in- 
cluding the outliers marked by asterisks and the standard deviations, is shown in ap- 
pendix B. The results of the analysis are shown in the following table: 


Significance level, percent 

10 

5 

1 

0. 1 Original 
data 

Reduced sample size 

61 

67 

70 

71 71 

Standard deviation of reduced 
sample 

4. 593 

5.711 

6.371 

6.749 6.749 


Example 2 


In this example the outlier editing program is used on a flight profile from the 
Limited Vision Landing Accuracy Study (commonly known as "Cyclops") in progress at 
the Flight Research Center. The observations are the altitudes of the aircraft at fixed 
distances from the touchdown target point on the runway. The flight profile model is 
assumed to be a second -degree polynomial. The deviations of the observations from 
this flight profile are the inputs to this program. These deviations satisfy the require- 
ments of random sampling from a normal population. The computer output identifying 
outliers with asterisks is presented in appendix C. The flight profile and outliers are 
shown in figure 1. 
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Altitude of 
aircraft, 
feet 


Distance from touchdown target point, meters 
-250 -200 -150 -100 -50 0 



Altitude of 
aircraft, 
meters 


Figure 1. - Observations of aircraft altitude variation with distance from the runway touchdown 
target point for the Limited Vision Landing Accuracy Study. 

The results of the analysis at four levels of significance are presented in the following 
table: 


Significance level, percent 

10 

5 

1 

0. 1 

Original 

data 

Reduced sample size 

33 

34 

34 

34 

35 

Standard deviation of reduced 
sample 

0.453 

0.475 

0.475 

0.475 

0.564 


REMARKS 


Although the method of detecting and identifying the outliers described in this 
paper is being used for biodata editing at the Flight Research Center, its applicability 
in other areas is obvious. Results indicate that statistical analyses after removal of 
outliers have a lower standard deviation than when the outliers are not removed. The 
sample sizes are usually larger than 30, so reduction in sample size did not affect the 
validity of the results. 


Flight Research Center, 

National Aeronautics and Space Administration, 
Edwards, Calif., March 25, 1969, 
127-51-03-01-24. 
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APPENDIX A 


PROGRAM SOURCE LISTING 


c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


OUTLIE 

PURPOSE 

TEST EXTREME VALUES OF AN ARRAY 
METHOD 

LET X(l) THRU X ( NF I N ) BE DISTRIBUTED N(0t5> THEN 
T = X / SORT ( 1 /N* SUM ( X **2 ) 

WHERE SUM EXCLUDES VALUE X IN THE NUMERATOR 
N = SAMPLE SIZE 

TAU = <T*S0RTlN-l))/S0RT(N-2+ T**2) 

TO TEST OUTLIERS CALCULATE TAU(OBS) = ( X-U ) /S 

WHERE 

S**2 = 1/N*SUM( (X-U)**2> 

U = MEAN 

IF ABS(TAUtOBS) ) > TAU t X IS AN OUTLIER 
REFERENCE 

MATHEMATICAL METHODS OF STATISTICSt A. CRAMER 1961 

DESCRIPTION OF PARAMETERS 

X = ARRAY ASSUMED TO B N(U,S) 

NLIM - NUMBER OF X VALUES 

T = STUDENT'S T-VALUE AT ALPHA LEVEL AND N-2 DEGREES 
OF FREEDOM 

INPUT 

CARD 1 FORMAT OF X ARRAY READ IN (20 A4) 

CARD 2 

COL 1 THRU 5 NLIM = NO. OF X VALUES 

COL 1C NT = NO. OF ALPHA VALUES 

COL 11 THRU 15 ALPHA 1 

COL 16 THRU 20 ALPHA 2 

COL 21 THRU 25 ALPHA 3 

COL 26 THRU 30 ALPHA 4 

LIMIT OF 4 ALPHA VALUES WHICH MUST BE 
EQUAL TO 0.10, 0.05, 0.01, OR 0.001 
OTHER VALUES WILL NOT BE ACCEPTED BY THE 
PROGRAM 

CARD 3 THRU . . . 

DATA FORMATED AS PRESCRIBED IN CARD 1 
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ononnooooooo 


OUTPUT 

1 MEAN 

2 STANDARD DEVIATION 

3 LIST OF DATA WITH THE OUTLIERS IDENTIFIED BY AN ASTERISK 

4 MEAN WITH OUTLIERS DELETED 

5 STANDARD DEVIATION WITH OUTLIERS DELETED 

6 SAMPLE SIZE WITHOUT THE OUTLIERS 

NOTE STEPS 3 THRU 6 ARE REPEATED FOR EACH LEVEL OF ALPHA 
USED. 


DIMENSION X( 1000) , I TC( 1000 ), ALPHA ( 5 ) * FMTI20), TVALI5) 

REALMS SUMI, SUM2 » SUMNEW, SUMN2 
DATA IDUM1/4H /.IDUM2/4H* / 

1 SUMI = 0.0 
SUM2 = 0.0 

READ(1 ,100,END=999) FMT . NLIM, NT , I ALPHA! J ) ♦ J= 1 , NT ) 

100 FORMAT! 2CA4/2I 5.4F5.3) 

WR I TE ( 3 » 200 ) FMT 

200 FORMAT! 1H1.30X, *P ROGRAM TO TEST OUTLIERS 
l'///' FORMAT OF INPUT X VALUES IS '20A4J 
READ!1»FMT)!X( I).I=1»NLIM ) 

DO 20 1=1, NLIM 
SUMI = SUMI + X! I) 

20 SUM2 = SUM 2 + X(I)*X(I) 

SDE V = ( SUM2 -<SUM1*SUM1)/NLIM)/(NLIM) 

SDE V = SORT ( SDE V ) 

XMEAN = SUM1/NLIM 
ALIM = NLIM 

WRITE ( 3* 160) XMEAN. SDEV 

160 FORMAT ( 1H . ’ MFAN =*F10.4/» STANDARD DEVIATION = ', 

IF 1 0 .4 ) 

DO 30 J= 1 .NT 

CALL STUDNT! TVAL! J) .NLIM, ALPHA ( J ) ) 

30 TVAL(J) = (TVAL! J ) *SQRT < AL I M-l . ) )/SQRT( ALIM-2. +TV AL ( J ) *TV AL ( J ) ) 

DO 70 J = 1 . NT 
WRITE (3,130) 

130 FORMAT! 1H1, ‘LIST OF INPUT VALUES ' //9X ,' N' , 14X ,' X VALUE ', 9X ,' N ', 13X 
A ,'X VALUE'/) 

SUMNEW = SUMI 
SUMN2 = SUM2 
NCOUNT = NLIM 
DO 60 1= 1, NLIM 
150 FORMAT ( 1H1 ) 

ITC ( I ) = IDUM1 
TAU = (X(I) - XMEAN) /SDEV 
IFIABS(TAU) .LE.TVAL(J) ) GO TO 90 
ITC! I ) = I0UM2 
SUMNEW = SUMNEW— X ( I ) 

SUMN2 = SUMN2 -X(I)*X(I> 

NCOUNT = NCOUNT -1 
90 CONTINUE 
60 CONTINUE 

WRITE(3»110) ( I »X( I ) , ITC! I ) ,I=1»NLIM) 

110 FORMAT! IX, I 10.F2C.5.A1, I9.F20.5.A1) 

WRITE13.1 20 ) ALPHA ( J ) 
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120 FORMAT ( 1H0 » •* INDICATES OUTLIER VALUE AT *F5.3,' SIGNIFICANCE LEV 
IE L ' ) 

EXPT = SUMNEW/NCOUNT 

SDEV1 = (SUMN2-! SUMNEW*$UMNE W) /NCUUNT ) / ( NCOUNT ) 

SDF V 1= SORT ( SDE VI ) 

WRI TE l 3 , 1 70 ) EXPT, SDEV1, NCOUNT 

170 FORMAT ( ' CPOPULA TION PARAMETERS WITH OUTLIERS DELETED* /• MEAN 

1 = * F 10 . A/ * STANOARD DEVIATION ='F10.4/' N 

2 = * I 5// / ) 

70 CONTINUE 

GO TO 1 
999 RFTURN 
END 

SUBROUTINE STUDNT < T , N . A ) 

DIMENSION TABLE ( 4, 34),LTAB< 4!,ATAB(4) 

DATA LTAB / 30.40,60,120 / ,A TAB/O . 1 0 ,0 . 05 , 0 . 01 , 0 .00 1/ 

DATA TABLE/6. 314, 12. 706, 63. 65 7, 636. 619, 2. 920, 4. 30 3, 9. 92 5, 31. 5 98, 
12.353,3.182,5.841, 12.924,2. 13 2 , 2 . 776 , 4 . 604 , 8 . 6 10, 2 . 0 1 5 , 2 . 571 , 4 . 032 
2, 6.869, 1.943,2.447,3. 707,5. 559, 1.895,2. 365,3.499,5.408, 1.860, 2.306 
3, 3.355, 5. 04 1,1. 83 3, 2. 262,3 .250, 4. 78 1,1. 812, 2. 22 8, 3. 169, 4. 587, 1.796 
4.2.201,3. 106,4.437,1.782,2. 179,3.055,4.318,1.771,2. 160,3.012,4.221 
5. 1.76 1,2. 145, 2. 977, 4. 140,1. 75 3, 2. 131, 2. 94 7, 4. 073, 1.746,2.120,2.921 
6,4.015, 1.740,2.110,2.898,3.965,1.734,2.101,2.878,3.922,1.729,2.093 
7, 2. 861, 3. 883, 1 . 725 , 2 . 086 , 2 . 84 5, 3. 85 0,1. 72 1,2. 080, 2. 831, 3. 819, 1.717 
8, 2.074, 2. 819,3. 792, 1 . 71 4 , 2 . 069 , 2 . 80 7 , 3 . 767 , 1.711,2.064,2.797,3.745 
9, 1. 70 8, 2. 060,2. 78 7,3.72 5, 1 .70 6, 2. 056, 2. 779,3.707, 1 . 70 3 , 2 . 052 , 2 . 77 1 
A, 3. 690, 1.701,2.04 8,2.76 3,3.674,1.69 9,2.045,2. 756, 3. 659, 1.697, 2. 042 

1, 2. 750, 3.646, 1.6 84,2.021,2 . 704 , 3 . 55 1 , 1 . 671 , 2 . 000 , 2. 660,3.460, 1 .658 

2, 1.980.- 2. 61 7, 3. 3 73, 1.645,1 . 960,2.576,3.291/ 

DO 10 1=1,4 

I F ( A . EO . A TAB ( I ) ) GO TO 20 

10 FONT I NUF 
WRI TE ! 3 » 1 ) A 

1 FORMAT ( 1H0, ' AL°HA VALUE = *F5.4,* IS NOT A TABLED VALUE* ) 

WRI TE ( 3 , 167 ) 

167 FORMAT ( 1H ,' ALPHA SET EQUAL TO 0.10') 

I = 1 
20 11= I 

J F ( N.LE.30) GO TO 40 
DO 30 I = 2,4 
I F ( N-L T AB ( I ) 150,40,30 
30 CONTINUE 

INTERPOLATE WITH DF 120, INFIN 

T = TABLE ( 11,34) + ( 1 . 0/N ) / { 1 . 0/L TAB ( 4 ) ) *< T ABLE! 1 1 , 33 )— TABLE ( 1 1 , 34 

1) ) 

RETURN 

40 T = TABLE! II, N) 

RETURN 

50 T = TABLE! I 1 , 1+29 ) + (1.0/N -1 . 0/ LT AB ( I ) ) / ( 1 .0 /LTAB ( I- 1 ) -1 .0/L T AB ( 

1 I n«( TABLE! 11, 1+28 ) - TABL E( 1 1 , 1+29 ) ) 

RETURN 

END 



APPENDIX B 


OUTPUT FOR EXAMPLE 1 


PROGRAM TO TEST OUTLIERS 

FORMAT OF INPUT X VALUES IS (14F5.3) 

MEAN = 98.5042 

STANDARD DEVIATION = 6.7494 

LIST OF INPUT VALUES 


N 

X VALUE 

N 

X VALUE 

1 

101.20000 

2 

95.20000 

3 

8 5.7 99 9 9# 

4 

82.20000* 

5 

96. 70000 

6 

95.29999 

7 

98.29999 

8 

101.59999 

9 

94.89999 

10 

90.79999 

11 

92.2 00 CO 

12 

97.00000 

13 

95.29999 

14 

94.79999 

15 

104.20C0C 

16 

118.20000* 

17 

104.29999 

18 

110.50000* 

19 

1 C 7 . 50000 

20 

104.70000 

21 

104.29999 

22 

103.39999 

23 

104. 700G0 

24 

101.59999 

25 

1C 2. 2 00 GO 

26 

105.39999 

27 

103. 50CC0 

28 

98.C0C00 

29 

105.C0000 

30 

103.39999 

31 

104.00000 

32 

97.89999 

33 

ICO. 39999 

34 

99.29999 

35 

ICC. 89999 

36 

98.79999 

37 

99.89999 

3 8 

106.39999 

39 

96.29999 

40 

102.7C00G 

41 

105. 09999 

42 

110.29999* 

43 

101 . 89999 

44 

96.89999 

45 

96. 5 0 0 0 C 

46 

96.29999 

47 

95. 7000 C 

48 

98.0 0000 

49 

90. 29999 

50 

88.89999 

51 

97.29999 

52 

86.2000C* 

53 

86 • 50000* 

54- 

96.29999 

55 

97.09999 

56 

96.500 JO 

57 

89. 0000 C 

58 

84.70000* 

59 

97.00000 

60 

68.59999 

61 

97. 79999 

62 

95.20000 

63 

98.39999 

64 

112.79999* 

65 

111. 09999* 

66 

97.00000 

67 

95. 50000 

68 

97.5C00C 

69 

93.00000 

70 

94.59999 

71 

93.000CC 




* INDICATES OUTLIER VALUE AT 0.100 SIGNIFICANCE LEVEL 

POPULATION PARAMETERS WITH OUTLIERS DELETED 
MEAN = 98.4508 

STANDARD DEVIATION = 4.5932 

N =61 
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LIST OF INPUT VALUES 


N 

X VALUF 

N 

X VALUE 

1 

101.20000 

2 

95 .20000 

3 

85. 79999 

4 

82 • 20000 * 

5 

96.70000 

6 

95.29999 

7 

98.29999 

8 

101.59999 

9 

94. 89999 

10 

90.79999 

11 

92.20000 

12 

97.00000 

13 

95.29999 

14 

94.79999 

15 

104.20000 

16 

118.20000* 

17 

1C4. 29999 

18 

110.50000 

19 

107. 50000 

20 

104.70000 

21 

104.29999 

22 

1U3. 39999 

23 

104. 730C0 

24 

101.59999 

25 

102.20000 

26 

105.39999 

27 

103.50000 

28 

98.00000 

29 

105. 00000 

30 

103.39999 

31 

104. COCO 0 

32 

97.89999 

33 

100.39999 

34 

99.29999 

35 

ICC. 89999 

36 

98.79999 

37 

99. 89999 

38 

106.39999 

39 

96.29999 

40 

102.70000 

41 

1C5. 09999 

42 

110.29999 

43 

101 . 89999 

44 

96.89999 

45 

96. 50C00 

46 

96.29999 

47 

95. 7000C 

48 

98.00000 

49 

90.29999 

50 

88.89999 

51 

97.29999 

52 

86.20000 

53 

86. 50C0C 

54 

96.29999 

55 

97.09999 

56 

96.50000 

57 

89.00000 

58 

84.70000* 

59 

97.0C0CC 

60 

68.59999 

61 

97. 79999 

62 

95.20000 

63 

98. 39999 

64 

112.79999* 

65 

111. 09999 

66 

97.00000 

67 

95. 50000 

68 

97.50000 

69 

93.00000 

70 

94.59999 

71 

93.00000 




* INDICATES OUTLIER VALUE AT C.05C SIGNIFICANCE LEVEL 

POPULATION PARAMETERS WITH OUTLIERS DELETED 
MEAN = 98. 4463 

STANDARD DEVIATION = 5.7113 

N =67 



L T ST OF INPUT VALUES 


N 

X VALUE 

N 

X VALUE 

1 

101.20000 

2 

95.20000 

3 

85. 79999 

4 

82.20000 

5 

96.70000 

6 

95.29999 

7 

98.29999 

8 

101.59999 

9 

94. 89999 

10 

90.79999 

LI 

92.20000 

12 

97.00000 

13 

95. 29999 

14 

94.79999 

15 

104.20000 

16 

118. 20 000* 

17 

104.29999 

18 

110.50000 

19 

107.50000 

20 

104.70000 

21 

104.29999 

22 

103.39999 

23 

104. 70000 

24 

101.59999 

25 

102.20000 

26 

105.39999 

27 

103.50000 

28 

98.00000 

29 

105.00000 

30 

103.59999 

31 

104.00000 

32 

97.89999 

33 

100.39999 

34 

99.29999 

35 

ICO. 89999 

36 

98.79999 

37 

99. 89999 

3 8 

lu6. 39999 

39 

96.29999 

40 

102. 700 OC 

41 

1C5. 09999 

42 

11 j. 29999 

43 

101.89999 

44 

96.89999 

45 

96. 5 0 0 0 C 

46 

96.29999 

47 

95. 70000 

48 

98 . UGCQC 

49 

90.29999 

50 

83.d9999 

51 

97.29999 

52 

86.2000C 

53 

86. 50CC0 

54 

96.29995 

55 

97.09999 

56 

96.50000 

57 

89.00000 

58 

84 . 700 Ou 

59 

97.00000 

60 

88.59999 

61 

97. 79999 

62 

95.2C00C 

63 

98.39999 

64 

11^.79999 

65 

111.09999 

66 

97. 0006 C 

67 

95.50000 

68 

9 7 . 5 GOO C 

69 

93.00000 

70 

94.59999 

71 

93.00000 




* INDICATES OUTLIER VALUE AT O.OIO SIGNIFICANCE LEVEL 

POPULATION PARAMETERS WITH OUTLIERS DELETED 
MEAN = 98.2228 

STANDARD DEVIATION = 6.3706 

N =70 



LIST OF INPUT VALUES 


N 

X VALUE 

N 

X VALUE 

1 

101.20000 

2 

95.20000 

3 

85.79999 

4 

82.2COOC 

5 

96.70000 

6 

95.29999 

7 

98.29999 

8 

101.59999 

9 

94. 89999 

10 

90.79999 

11 

92.20000 

12 

97 .00000 

13 

95. 29999 

14 

94.79999 

15 

104. 20000 

16 

118.2C000 

17 

104.29999 

18 

110.50000 

19 

107. 5000C 

20 

104 .70000 

21 

104.29999 

22 

103.39999 

23 

104. 7000 C 

24 

101. 59999 

25 

102.20000 

26 

105.39999 

27 

1C3.5000C 

28 

98.00000 

29 

1C5. 00000 

30 

103.39999 

31 

104.00000 

32 

97.89999 

33 

100. 39999 

34 

99.29999 

35 

100. 89999 

36 

98.79999 

37 

99. 89999 

38 

106.39999 

39 

96.29999 

40 

102.70000 

41 

105.09999 

42 

110.29999 

43 

101.89999 

44 

96.89999 

45 

96. 50CC0 

46 

96.29999 

47 

95.70000 

48 

98.0 COCO 

49 

90.29999 

50 

88.89999 

51 

97.29999 

52 

86.20000 

53 

86.50000 

54 

96.29999 

55 

97.09999 

56 

96.50000 

57 

89.00000 

58 

84.70000 

59 

97.00000 

60 

88.59999 

61 

97. 79999 

62 

95.200CC 

63 

98.39999 

64 

112.79999 

65 

111. 09999 

66 

97.00000 

67 

95. 50000 

68 

97.50000 

69 

93.00000 

70 

94.59999 

71 

93.000 OC 




* INDICATES OUTLIER VALUE AT 0.001 SIGNIFICANCE LEVEL 

POPULATION PARAMETERS WITH OUTLIERS DELETED 
MEAN = 9R.5042 

STANDARD DEVIATION = 6.7494 

N - 71 



APPENDIX C 


OUTPUT FOR EXAMPLE 2 


PROGRAM TO TEST OUTLIERS 

FORMAT OF INPUT X VALUES IS (14F5.3) 

MEAN = -0.0002 

STANDARD DEVIATION = 0.5636 

L T ST OF INPUT VALUES 


N 

X VALUE 

1 

0. 250C0 

3 

0.47000 

5 

0.38100 

7 

0.18700 

9 

-0.77000 

11 

-0.46500 

13 

-0. 107CC 

15 

-C. 53800 

17 

0. 06800 

19 

1.82600* 

21 

0.45000 

23 

0.33600 

25 

0.41800 

27 

0. 14000 

29 

-0.885G0 

31 

-0. 65300 

33 

-0.249CC 

35 

-0.06600 


N 

X VALUE 

2 

0.47400 

4 

0.39700 

6 

0.34300 

8 

-0.13300 

10 

-0.58000 

12 

-0.24300 

14 

-0.29200 

16 

-0 .06500 

18 

-0.31000 

20 

0.90500 

22 

0.01300 

24 

0.654G0 

26 

0.08000 

28 

-0.989GC* 

30 

-0.885CC 

32 

—0 .46500 

34 

0.29700 


* INDICATES OUTLIER VALUE AT 0.10C SIGNIFICANCE LEVEL 


POPULATION PARAMETERS 
MEAN 

STANDARD DEVIATION = 

N 


WITH OUTLIERS DELETED 
-0.0255 
0.4533 
33 



LIST OF INPUT VALUES 


N 


X VALUE 

N 

X VALUE 

1 


0.25000 

2 

0.47400 

3 


0.47000 

4 

0.39700 

5 


0. 38100 

6 

0.34300 

7 


0.13700 

8 

-0. 133GC 

9 


-0 . 770 C 0 

10 

“0 .58000 

11 


-0.45500 

12 

-C. 24300 

13 


-C. 107C0 

14 

-0.29200 

15 


-0. 53800 

16 

-0 .Cb50C 

17 


0.0 6800 

18 

-0 . 31000 

19 


1.82600* 

20 

C.9Q50G 

21 


0.45000 

22 

C.C130C 

23 


0.3 360C 

24 

0.65400 

25 


0.418C0 

26 

0 .08OOO 

27 


C. 14000 

28 

-0 .98900 

29 


-0.885C0 

3C 

— 0 . 8 85 0 C 

31 


-G.6530C 

32 

-0.46500 

33 


-0. 249GC 

34 

0.2970C 

35 


-C • 066 CC 



* INDICATES 

OUTLIER VALUE AT 0.050 

SIGNIFICANCE 

LEVEL 

POPULATION 

PARAMETERS 

WITH OUTLIERS 

• DELETED 


MEAN 

= 

-0.0539 



STANDARD DEVIATION = 

0.4754 



N 

= 

34 



LIST OF INPUT VALUES 




N 


X VALUE 

N 

X VALUE 

1 


C. 25000 

2 

0.47400 

? 


0.47000 

4 

0.39700 

5 


0.33100 

6 

0 . 3 4 3 0 G 

7 


C. 187C0 

8 

-G. 13300 

9 


-0.77000 

10 

-0.58000 

11 


-0.46500 

12 

-0.24300 

13 


-0. 10700 

14 

-0.29200 

15 


-C. 53800 

16 

-0 .06500 

17 


C.0680C 

18 

-0.31000 

19 


1.826C0* 

20 

0.90500 

21 


0.45CCC 

22 

0.0130C 

23 


0.33600 

24 

0.654C0 

25 


C. 41800 

26 

0.0800C 

27 


0. 14000 

28 

— 0 . 9 890C 

29 


-0. 8350C 

30 

-0 .88500 

31 


-0.65300 

32 

-0.46500 

33 


-0.24900 

34 

0 . 2 9 7 u 0 

35 


-0.06600 



* INDICATES 

OUTLIER VALUE AT 0.010 

SIGNIFICANCE 

LEVEL 

POPULATION 1 

PARAMETERS 

WITH GUTLTERS 

DELETED 


MEAN 

= 

-0.0539 



STANDARD DEVIATION = 

0.4754 



N 

= 

34 





LIST OF INPUT VALUES 


N 

X VALUE 

N 

X VALUE 

1 

0.25000 

2 

0.4740C 

3 

0.470CC 

4 

0.39700 

5 

0.38100 

6 

0.34300 

7 

0. 1870C 

8 

-0.1 3300 

9 

-0.77000 

10 

-0.58000 

11 

— 0 . 465 OC 

12 

— C • 243uC 

13 

-0.10700 

14 

-0.29200 

15 

-0.53800 

16 

-0.0650 C 

17 

0.06800 

18 

-0.3100C 

19 

1.82600* 

20 

0.90500 

21 

0.45000 

22 

0.01300 

23 

C. 33600 

24 

0.65400 

25 

0.41800 

26 

0.08000 

27 

0. 14000 

28 

-0.98900 

29 

-0.88500 

30 

-0.88500 

31 

-0.65300 

32 

-0.46500 

33 

-0.24900 

34 

0.29700 

35 

-0 . 06600 




* INDICATES OUTLIER VALUE AT C.OOi SIGNIFICANCE LEVEL 


POPULATION PARAMETERS 
MEAN 

STANDARD DEVIATION = 

N 


WITH OUTLIERS DELETED 
-0.0539 
0.4754 
34 


17 
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TABLE I.- UPPER PERCENTAGE POINTS OF THE t DISTRIBUTION 



n 

7 ** | 

a = 0.05 

a = 0.01 

a = 0.001 

3 

1. 410 

1.414 

1.414 

4 

1. 645 

1.715 

1. 730 

5 

1.757 

1.918 

1. 982 

6 

1. 814 

2.051 

2. 178 

7 

1. 848 

2. 142 

2.329 

8 

1. 870 

2.207 

2.447 

9 

1. 885 

2.256 

2. 540 

10 

1.896 

2.294 

2. 616 

12 

1. 910 

2.348 

2. 730 

14 

1.920 

2.385 

2. 812 

16 

1. 926 

2.411 

2. 873 

18 

1. 931 

2.421 

2. 921 

20 

1. 934 

2.447 

2. 959 

25 

1. 940 

2.474 

3. 026 

30 

1. 944 

2.492 

3. 071 

40 

1. 948 

2.514 

3. 127 

60 

1.952 

2. 535 

3. 182 

120 

1. 956 

2. 555 

3.237 

30 

1. 960 

2. 576 

3.291 


NOTE: To calculate r* values for sample sizes or significance levels 
not shown, use the relationship 


* tVn - 1 

T* = — — 

\/n - 2 + t 2 

where t is StudenUs t -value at significance level a and n - 2 degrees 
of freedom and n is sample size. 


NASA-Langley, 1969 
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